Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

نویسندگان

چکیده

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater’s or ratee’s gender, major, experience. Identification heterogeneity sources in IRR important for implementation policies with potential to decrease measurement error increase focusing on most relevant subgroups. In this study, we propose flexible approach assessing cases due covariates directly modeling differences variance components. We use Bayes factors (BFs) select best performing model, suggest using Bayesian model averaging an alternative obtaining component estimates, allowing us account uncertainty. inclusion BFs considering whole space provide evidence against components covariates. The proposed method compared other frequentist approaches simulation demonstrate its superiority some situations. Finally, real data examples from grant proposal peer review, demonstrating usefulness flexibility generalization more complex designs.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variance Estimation of Nominal-scale Inter-rater Reliability with Random Selection of Raters

Most inter-rater reliability studies using nominal scales suggest the existence of two populations of inference: the population of subjects (collection of objects or persons to be rated) and that of raters. Consequently, the sampling variance of the inter-rater reliability coefficient can be seen as a result of the combined effect of the sampling of subjects and raters. However, all inter-rater...

متن کامل

[Brief Suicide Questionnaire. Inter-rater reliability].

INTRODUCTION Inter-rater agreement is a crucial aspect in the planning and performance of a clinical trial in which the main assessment tool is the clinical interview. The main objectives of this study are to study the inter-rater agreement of a tool for the assessment of suicidal behavior (Brief Suicide Questionnaire) and to examine whether the inter-examiner agreement when multiple ratings ar...

متن کامل

Inter-rater Reliability of Videofluoroscopic Dysphagia Scale

OBJECTIVE To investigate the inter-rater agreement using the Videofluoroscopic Dysphagia Scale (VDS). METHOD The present study was designed as a multicenter, single-blind trial. A Videofluoroscopic Swallowing Study (VFSS) was performed using the protocol described by J.A Logemann. Thick-fluid, pureed food, mechanically altered food, regularly textured food, and thin-fluid boluses were sequent...

متن کامل

Inter-Rater Reliability of the CASCADE Criteria

See related article, p 2439. Childhood arterial ischemic stroke (AIS) has received increasing attention as a cause of morbidity and mortality in the pediatric population. Several population-based studies in the United States demonstrate an incidence of childhood AIS at 2 cases per 100 000 children per year, with an incidence in other studies from Asia and Europe ranging from 0.2 to 7.9 per 100 ...

متن کامل

Computing Inter-Rater Reliability With the SAS System

The SAS system V.8 implements the computation of unweighted and weighted kappa statistics as an option in the FREQ procedure. A major limitation of this implementation is that the kappa statistic can only be evaluated when the number of raters is limited to 2. Extensions to the case of multiple raters due to Fleiss (1971) have not been implemented in the SAS system. A SAS macro called MAGREE.SA...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Educational and Behavioral Statistics

سال: 2023

ISSN: ['1076-9986', '1935-1054']

DOI: https://doi.org/10.3102/10769986221150517